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YEAST ARTIFICIAL CHROMOSOMES AND THEIR USE IN THE 

CONTROL OF GENE EXPRESSION 
Field of the Invention 

This invention relates to yeast artificial chromosomes 
5 and their manipulation and transfer into cells and animals, 
to exploit the control of gene expression, and also to the 
resulting cells and animals. 
Background of the Invention 

The ability to transform suitable hosts with foreign 
10 DNA, and thus to express gene products not normally 
produced by the host, is an important goal of 
biotechnological research. Microorganisms can be used to 
produce desired proteins, while higher animals having 
desirable characteristics have also been produced. For 
15 example, EP-A-0264166 , WO-A-8800239 , WO-A-8801648 , WO-A- 
9005188 and WO-A-9211358 describe the use of lactating 
animals to express foreign proteins which are produced in 
the milk and can be isolated therefrom; this provides a 
very satisfactory, controlled source of pure protein. 
20 Techniques to transfer cloned DNA into mammalian cells 

and transgenic animals have greatly facilitated the study 
of gene regulation and expression. Gene transfection 
experiments have also highlighted the fact that the limited 
size cf many cloned DNA molecules prevents the efficient 
25 use of the numerous distant regulatory sequences believed 
to control expression. 

More particularly, in order to investigate regulation 
of complex loci and chromosomal domains harbouring clusters 
of genes, it is essential to introduce very large pieces of 
3 0 DNA into cells and animals. The conventional approaches 
used in transgenic animal technology have limitations, 
making it difficult to introduce DNA fragments which are 
greater than 100 kb; see Bruggemann et ai, Eur. J. Immunol. 
2JL: 1323-1326 ( 1991). For example, while germ line- 
3 5 deoendent aenomic imorintina mav affect large areas in 
which a number of genes may be similarly regulated, there 
is no efficient method to srudv such "imcrinted" domains. 
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A satisfactory technique fcr introducing large DNA 
fragments would allow progress, and also facilitate tne 
analysis cf other complex loci such as the T-compiex 
(harbouring specific deletions) for which a number cf genes 
5 have been mapped which are crucial for mammalian 
development. In order to obtain a better understanding cf 
the regulation of eukaryotic genes, it would be desirable 
to express these genes in their authentic genomic context 
after cloning and re-introduction into cells. 

The expression of mammalian genes is controlled at 
various levels: the genomic context (influence of 
neighbouring genes) , regulatory elements proximal to the 
exons (e.g. promoters), regulatory sequences downstream of 
the termination codon (polyadenylation site) and regulatory 
motifs further away from coding sequences (e.g. enhancers) . 
For immunoglobulin genes, it has been shown that expression 
of transgenes is poor when enhancer motifs are missing, and 
that these can be several thousand base-pairs away (25 kb 
for the heavy chain 3 '-enhancer) from the nearest exon. 

Mouse models have been established tc address the 
Question of the immunogenic ity of chimaeric, foreign and 
authentic antibodies used for therapeutic purposes; see 
Bruggemann et al, J. Exp. Med. 170:2153 (1989). It became 
clear that only authentic proteins escape the surveillance 

25 of the immune system. 

■ The techniques currently used for making human 
antibodies involve either in vitro immunisation and 
immortalisation of human lymphocytes or genetic 
engineering. The selection of rare specific antibody- 
30 producing human lymphocytes outside the body is difficult 
and, once the lines are obtained, their yield and stability 
are poor; see Bcrrebaeck, Immunol. Today 9:355 (198S). 
Genetic engineering (also termed "humanisation" cf rodent 
antibodies) firstly has tc be done for each individual 
mouse cr rat antibody cf therapeutic use and secondly does 
net yield completely human antibodies; see Riechmar.n et al, 
Nature 322:323 (1938). "Humanising" existing rodent 
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antibodies already approved for therapy is currently the 
most successful way to obtain less immunogenic reagents. 
However; it would be a considerable improvement to have a 
mouse strain available which makes authentic human 
5 antibodies after immunisation with human materials. 

A repertoire of immunoglobulins has been obtained from 
transgenic mice carrying inserted human antibody gene 
segments in germ line configuration; see WO-A-9004 036 and 
Bruggemann et al, PNAS 86:6709 (1989). A human mini IgH 

10 locus has been constructed with variable region genes (Vs) , 
diversity segments (Ds) , joining segments (Js) and the p. 
constant region gene (C^) - The human gene segments 
rearrange in the lymphoid tissue of these mice (VDJ-Cm) and 
antibodies with human y. heavy chains can be obtained after 

15 immunisation. However, the level of human IgM as opposed 
to mouse IgM is low, and specific hybridomas with human y 
chains are rare after immunisation. A further complication 
is that most of those cells that produce human heavy chain 
also secrete endogenous mouse Ig. This means that 

20 rearrangement of human y. does not stop endogenous 
rearrangement; in other words, allelic exclusion is not 
achieved. Furthermore, the actual repertoire size of the 
produced human antibodies is unknown but might be small as 
the IgH construct contains only a limited number of V ana 

25 D segments. 

Srivastava ex al, Gene 103:53-59 (1991), describe 
Diasmids which permit the insertion of neomycin-resistance 
gene into the human DNA insert or the vector arm of a YAC. 
In the latter case, the plasmid alsc contains a LYS2 gene 

3 0 fcr selection in a yeast host. The URA3 gene is then, 
replaced by a new insert, thus inactivating URA. 

Bothsteir. et al, Science 240:14 39-1443 ( 1988), 
describe the practical advantages cf the yeast organism fcr 
cloning and manipulating large DNA molecules. Green et a^, 

3 5 Science 2 50 :94-9 3 (199G) , report the cloning cf segments o: 
up to one million base-pairs m yeast ceils ir, artificia- 
cr.romosor.es (YACs., , allowing the long-range mapping ant 
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analysis cf complex genomes. Yeast vectors for cloning cf 
large DNA molecules combine two features; plasinici sequences 
for their propagation in E. coii and yeast speciric 
seouences to ensure the replication and maintenance cr a 
5 linear molecule when grown in yeast. 

Nevertheless, the problem of producing large molecules 
(of which specific examples are immunoglobulins, Factor 
VIII and Factor IX) on a commercial scale, remains. At the 
molecular level, it would be desirable to introduce large 

10 DNA molecules containing single genes of considerable size, 
in order to facilitate correct expression. For example, 
the human Ig heavy chain locus accommodating all Vs, Ds, Js 
and C regions is estimated to spread over 3 00 0 kb of DNA; 
the Factor VIII gene is almost 200 kb in size with 23 

15 exons. At present, high level of expression is rarely 
achieved by the introduction of cDNAs and engineers genomic 
"minigenes" into transgenic animals. 
Summary of the Invention 

Surprisingly, it has now been found that specific DNA 

20 of considerable length can be introduced into ES cells via 
a YAC , without introducing DNA from the yeast. The need to 
purify the YAC out of yeast is thus avoided, and 
transformation is essentially foolproof. This discovery is 
applicable to the introduction of such DNA into other cells 

25 also. 

The appropriate YAC, for use in the present invention, 
includes a foreign gene or gene locus (i.e. including one 
or more genes or gene segments) of at least 100 kb and also 
a marker gene which allows selection in cells that are not 

3 0 prokaryotic or yeast cells. This construct can be obtained 
by integration of the marker gene, e.g. Neo , into a YAC 
including the foreign gene. 

In a further aspect cf the invention, ES cells or 
other cells are transformed by the marked YAC, and thus can 

35 be selected and inserted into, say, animals which 
consequently express human immunoglobulin or other genes. 
In particular, a transgenic lactaring, cvine or bovine 
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animal, mouse or other rodent:, or any non-human animal, may 
contain a foreign gene or gene locus of considerable 
length, e.g. at least 100 kb, often at least 300 kb, but 
also 1 Mb or more, if required. The animal may express the 
5 product of another animal, e.g. a human product, but not 
its own corresponding product. 
Description of the Invention 

The available YACs contain a marker gene that allows 
for selection in prokaryotic cells, e.g. for ampicillin- 

10 resistance, but do not contain a marker gene that would 
allow for selection for integration in eukaryotic, ES, 
somatic or mammalian cells. This invention utilises the 
recombination proficiency of yeast to introduce an 
antibiotic resistance into the YAC by recombination. Fcr 

15 example, neomycin-resistance gene controlled by the 
thymidine kinase promoter ( tk neo) will allow selection in 
mammalian cells. Other suitable markers are hygromycm and 
HPRT-resistance . Depending on the circumstances, it may be 
desirable to prepare YACs with different markers, for 

20 integration. 

The oresent invention is based in part on the 
realisation that yeast vectors with dominant selectable 
marker genes allow targeted integration into left 
(centromeric) and right ( non-centromer ic) YAC arms as well 

25 as alterations to human-derived insert DNA . In 
transformation experiments, integration proceeds 
exclusively by homologous recombination, although yeast 
prefers linear ends of homology for predefined insertions. 
Targeted regions can be rescued which expedite the cloning 

30 of internal human sequences and the identification of 5* 
and 3' YAC / insert borders. Integration of, say, 
neomycin-resistance gene into various parts of the 
allows the transfer and stable integration, and ais: 
rescue, of large DNA molecules into a variety of mammal iar 

35 cells including embryonic stem cells. 

The present invention utilises YACs as cloning 
vehicles fcr the complete integration cz DNA or a lengtr 
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that has previously ceen difficult :c transfer. Tne 
invention also allows specific modification of the 3KA. 

It inav be necessary tc enlarge an existing YAC 
containing heavy and light chain genes. A variety cf human 
heavy and light chain-containing cosmids is available; they 
may be added cn tc the YAC by homologous, site-specific cr 
other integration. Further, YACs containing different or 
overlapping parts of defined loci may be crossed, in order 
to obtain a contiguous DNA molecule in one YAC. This 
allows a large part of the human heavy and light chain gene 
clusters to be reconstituted, and to obtain transgenic 
animals which make authentic human antibodies. 

Similarly, a Factor IX-containing YAC may be modified 
in order to direct expression, for example, in milk. This 
15 may involve exchanging the Factor IX promoter for a milk 
gene promoter, e.g. the murine whey acidic protein gene 
-urometer, by homologous recombination in yeast. Factor 
VIII is another product that can be produced in the same 
way. 

2 0 It is an important feature of the invention that the 

marker gene is incorporated into YAC in an active for:?., in 
order to allow selection. For this purpose, the YAC 
without that marker is subjected to integration with a 
plasmid containing the marker gene and a - sequence outside 

25 the foreign gene, e.g. the ampicillin-resistance gene. 
This may lead to a YAC containing multiple copies cf both 
markers, and duplication at least is preferred, but the 
important point is that, say, neomycin-resistance can be 
observed . 

The starting materials and techniques for use in the 
invention are generally known, or the materials can be 
prepared by known techniques. For example, several 

separately-derived embryonic stem (tS) cen une^ are 
available: see Manscur et al, Nature 23_S.:34£ ( 1983;; 
Schwartzberg et al, Science 246 ; 799 (1989;; Johnson etal, 
Science 24^:1234 ( 1989); and Z i j lstra et al , Nature 342 : 4 2 5 
- 1989). Transformation and selection procedures nave 
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already been established. Immunoglobulin genes cn YACs are 
available, as is a YAC containing the Factor IX gene; these 
clones have been obtained independently by screening 
various YAC libraries; see Little et al, PNAS USA 86:1598 
(1989) . The sizes for the YACs vary and are between 200 
and 600 kb. Cosmids may increase size to, say, 1.8 Mb. 

The methods concerned with the introduction of large 
DNA molecules into cells are microinjection and co- 
precipitation with calcium phosphate (using any naked DNA 
such as YACs, genomic DNA or dissected chromosomes) or 
protoplast fusion, using yeast protoplasts: see Oi et al, 
PNAS USA 80:825-829 (1983); Graham and van der Eb, Virol. 
52:456-658 (1973); and Richa and Lo , Science 245 : 175-177 
(1989). The approach of directly injecting large DNA 
15 molecules into fertilised eggs or ES cells was difficult 
because of the nature of DNA (large molecules are sticky) . 
It is preferred to introduce the various YACs by protoplast 
fusion; this, however, needs the introduction of a 
selective marker gene into the YAC (see below) . Another 
approach is the transfection of high molecular weight DNA 
or chromosomal DNA, mixed with selective marker DNA, into 
ES cells by calcium phosphate co-precipitation . 
Identification of integrated genes of interest could then 
be done by probing with human Alu repeat sequences, 
confirming integration and size, as well as with specific 
probes. This random approach is similar to a library- 
screening and depends on the transfection frequencies. 
Multiple copies of selective marker genes may thus be 
produced . 

It is a primary object of the invention to introduce 
larae DNA molecules inro the germ line of mice or other 
non-human animals, e.g. via ES cells. For the purposes of 
this invention, such large molecules should be introduced 
in germ line configuration. By introducing the?, into ES 
3 5 ceils, the germ line locus, e.g. fcr antibodies cr 
immunoglobulins, rearranges in the lymphoid tissue cf the 
animal, and thus antibody production takes place. 
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Techniaues outlined acove make use of YACs tnat 
contain, fcr example, well-character ised parts of the human 
immunoglobulin light and heavy chain loci or the Factor IX 
gene. In addition, high molecular weight DNA carrying any 
5 (large) genes of interest may be transferred into a variety 
of cells, including ES cells which can and have been used 
to obtain transgenic mice. 

The purpose of introducing coding sequences and 
flanking regions on large DNA molecules is to preserve the 

10 original genomic context which facilitates the correct 
expression. These techniques also allow the introduction 
and study cf large gene families. In combination with gene 
targeting, authentic foreign proteins can be obtained in 
larae yield without interference cf the homologous 

15 endogenous gene products. In that way, an animal may be 
obtained with an immune system (in respect of antibody 
production) indistinguishable from that of man. 

The endogenous mouse antibodies may interfere with the 
transgenic human immunoglobulins. It may therefore be 

20 necessary to silence the mouse immunoglobulin- heavy and 
light chain loci by gene targeting in ES cells. A mouse 
strain can then be obtained with a transgenic human 
immunoglobulin heavy and light chain gene cluster, that 
makes no antibodies cf its own. 

25 in a particular example of the invention, a neomycin 

resistance cassette was integrated into a large (300 kb) 
YAC. The YAC contains a well-characterised region of the 
human immunoglobulin (Ig) kappa (k) light chain locus. The 
modified IGk YAC was transferred into embryonic stem (ES) 

30 ceil lines by spheroplast fusion. The approach is useful 
for the transfer and expression of large genes and gene 
families in their original genomic context into the 
germline of other species via ES cells. 

The k locus in humans is estimated to spread over 2 500 

3 5 thousand base-pairs of DNA. The genes of the 

immunoglobulin k iicht chair, are assembled during E-ceil 
differentiation bv somatic recombination: one cf the ma 
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(V=variable) gene segments rearranges with one of the 
five J K (J=joining) segments, and a C K (C-constant region) 
polypeptide is transcribed. In order to investigate the 
expression of such complex loci, it is essential to 
introduce very large fragments, containing many gene 
segments as well as necessary regulatory sequences, into 

cells and animals. 

ES cells are suitable for gene manipulation 
experiments such as the introduction of large DNA molecules 
on YACs. Furthermore, ES cells can be reintroduced into 
blastocysts, and chimaeric and germline mice which carry 
the introduced loci can be derived from them. 

The invention is further illustrated by the specific 
description that follows, and also by the Materials and 
Methods section of Davies et al , Nucleic Acids Research 
20(11) : 2693-2698 (1992). The contents of that article are 
incorporated herein by reference. 

The YAC containing the human k locus does not contain 
a marker gene which allows selection in ES cells. 
Therefore, in accordance with one embodiment of the 
invention, the neomycin resistance gene (neo) was 
integrated into the YAC. The neo gene permits selection of 
stable clones when transferred into mammalian cells. An 
example of the yeast-selectable marker gene that is also 
usually used is LYS2 which allows growth in lysine- 
deficient yeast media. Integration of exogenous DNA in 
yeast proceeds in an homologous fashion, and introducing 
new sequences into YACs, termed "retrofitting 11 by Eliceiri 
et al, PNAS USA 88: 2179-2183 (1991) is facilitated because 
of the plasmid homology region of the YAC arms. 

Targeted integration in yeast is either performed 
using replacement constructs or integration vectors; see 
Scherer et al, PNAS USA 76:4951-4955 (1979) ; Pavan et al, 
Mol. Cell. Biol. 10:4163-4169 (1990), and PNAS USA 8_8:778S- 
7791 (1991) ; and Srivastava ez al, supra . Replacement 
vectors are designed to disrupt a region of homology by the 
insertion of exogenous DNA. In the particular example, 
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homologous integration into YACs was studied using 
integration vectors which recombine as a whole and 
duplicate a given target sequence without impairing its 
function. Integration vectors can be rescued from YACs and 
also from transfected cells, as the bacterial sequences 
allow subcloning in bacteria, for example, and this permits 
the isolation of flanking insert DNA. The universal 
principle of integration using a set of vectors, several 
targeting sites and different YACs is shown. The modified 
YACs have been stably introduced into embryonic stem cells: 
thus, the experiments show the feasibility of transferring 
complex gene loci from one species to another. 

An important aspect of the evidence is that YACs can 
be used to transfer such loci into other cells, without 
transferring potentially-undesirable yeast DNA, while 
retaining the locus in its essential configuration and 
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CLAIMS 

1. A cell that is not prokaryotic or a yeast cell, 
transformed with a foreign gene or gene locus of at least 
100 kb and also a marker gene which allows selection in the 

5 or another such cell. 

2. A cell according to claim 1, which is an embryonic 
stem cell. 

3. A cell according to claim 1 or claim 2, wherein the 
marker gene is in addition to an ampicillin-resistance or 

10 other prokaryotic selective marker gene and/or uracil- 
resistance or other yeast-selective marker gene, 

4. A cell according to any preceding claim, comprising at 
least two copies of the or each marker gene. 

5. A cell according to any preceding claim, wherein the 
15 foreign gene or gene locus is in germ line configuration. 

6. A cell according to any preceding claim, essentially 
free of yeast DNA. 

7. A yeast artificial chromosome (YAC) including a 
foreign gene or gene locus of at least 100 kb and also a 

20 marker gene which allows selection in cells that are not 
prokaryotic or yeast cells. 

8. A YAC according to claim 7, wherein the marker is for 
resistance to neomycin, hygromycin or HPRT. 

9. A YAC according to claim 7 or claim' 8, wherein the 
25 foreign gene or gene locus and/or the marker gene are as 

defined in any of claims 3 to 5. 

10. A transgenic non-human animal including a foreign gene 
or gene locus of at least 100 kb. 

11. A transgenic non-human animal capable of expressing a 
30 the product of another animal but not its own corresponding 

product . 

12. An animal according to claim 11, wherein said another 
animal is human. 

13. An animal according to claim 10 and also claim 11 or 
35 claim 12. 

14. An animal according to any of claims 10 to 13, which 
is a lactating animal and the gene product is isolatable 
from its milk. 
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